ANICK-TYPE RESOLUTIONS 
AND CONSECUTIVE PATTERN AVOIDANCE 



o 

(N 



VLADIMIR DOTSENKO AND ANTON KHOROSHKIN 



Abstract. For permutations avoiding consecutive patterns from a given 
^ , set, we present a combinatorial formula for the multiplicative inverse of 

| ■ the corresponding exponential generating function. The formula comes 

from homological algebra considerations in the same sense as the corre- 
sponding inversion formula for avoiding word patterns comes from the 
well known Anick's resolution. 



in 



1. Introduction 



o 

u 

The purpose of this paper is to present a formula for the multiplicative in- 
verse for the formal series enumerating permutations avoiding the given set 
of consecutive patterns. There are various formulas of that sort, one based 
on a version of inclusion-exclusion principle, namely the cluster method 
of Goulden and Jackson [8, 18], and another more recent and much more 
compact, based on the symmetric functions method of Mendes and Rem- 
mel [17]. Our formula should be thought of as a mixture of these two: on 
■ one hand, it is based on combinatorial data somewhat similar to Goulden- 

\ Jackson clusters, on the other hand, it takes care of most cancellations which 

O ■ more resembles what happens in [17]. 

In the case of pattern avoidance in words, a similar result was obtained by 
Anick [1]; however, the emphasis of his paper was on applications to homo- 
logical algebra, and it never attracted attention of specialists in enumerative 
combinatorics. Consequently, a rather straightforward generalization of his 
method to the case of consecutive patterns in permutations has never been 
discovered. We present such a generalisation in this paper. Our intuition 
here comes from homological algebra as well; our construction is based on 
free resolutions of Anick type for shuffle algebras [19] defined by generators 
and relations. This approach extends without any changes to the case of 
coloured permutations avoiding the given set of consecutive patterns [16]. 
For the original Goulden-Jackson formula, a homological proof can also be 
obtained, using constructions in the spirit of [3]; we do not intend to discuss 
it in detail. 

This paper is organized as follows. In Section 2 we briefly recall the 
Goulden-Jackson cluster method, define "chains" (which give an improved 
version of Goulden-Jackson clusters), and prove the inversion formula. To 
make most of the text accessible to the general mathematical audience, we 
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chose to present the proof in the most elementary way, and use a sign- 
reversing involution instead of talking about boundary maps and chain 
homotopies. 

In Section 3, we consider various applications. Even staying within 
the cluster method, it is possible to solve some problems on consecutive 
pattern avoidance. In particular, we deduce a result in theory of pattern 
avoidance that seems to be new: for a permutation x of length k without 
self-overlaps, the number of permutations of given length with the given 
number of occurrences of t depends only on k, t(1), and z{k). This was 
formulated as a conjecture by Sergi Elizalde [6]. 1 We also show how our 
method applies to patterns of length 4 (obtaining some formulas that seem 
to be new), simultaneous avoidance of some patterns of length 3, and a well 
known result on rises in permutations.. 

In Appendix, we briefly explain the homological algebra behind the 
story, putting our proofs in the context of homological algebra for shuffle 
algebras [19] and ideals of the associative operad [14, 15]. 

The authors wish to thank Sergi Elizalde and Sergey Kitaev for their 
remarks. The work on this paper started when the second author was 
visiting Dublin Institute for Advanced Studies; he expresses his gratitude 
to all the staff there for their hospitality. 

2. Clusters and chains 

2.1. Consecutive pattern avoidance. Let us recall some definitions and 
notation. A permutation of length n is a sequence containing each of the 
numbers 1,. . . , n exactly once. To every sequence s of length k consisting 
of k distinct numbers, we assign a permutation st(s) of length k called the 
standardization of s; it is uniquely determined by the condition that s, < s,- 
if and only if st(s), < st(s)y, for example, st(153) = (132). In other words, st(s) 
is a permutation whose relative order of entries is the same as that of s. We 
say that a permutation o avoids the given permutation t as a consecutive 
pattern if for each i < ] we have st(a,-a,-+i . . . oj) + t, otherwise we say that 
o contains t as a consecutive pattern. Throughout this paper, we only deal 
with consecutive patterns, so the word "consecutive" will be omitted. For 
historical information on pattern avoidance in general and the state-of-art 
for consecutive patterns, we refer the reader to [12, 23]. 

The central question arising in the theory of pattern avoidance is that of 
enumeration of permutations of given length that avoid the given set of 
forbidden patterns P or, more generally, contain the given number of oc- 
currences of patterns from P. This question naturally leads to the following 
equivalence relations. Two sets of patterns P and P' are said to be Wilf 
equivalent (notation: P P') if for every n, the number of P-avoiding 
permutations of length n is equal to the number of P' -avoiding permuta- 
tions of length n. This notion (in the case of one pattern) is due to Wilf [25]. 
More generally, P and P' are said to be equivalent (notation: P ^ P') if for 
every n and every <k < n, the number of permutations of length n with k 

While preparing this paper, we learned that this conjecture is proved in an upcoming 
paper of Jeffrey Remmel, based on methods developed in [17]. We wish to thank Sergey 
Kitaev for informing us of that. 
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occurrences of patterns from P is equal to the number of permutations of 
length n with k occurrences of patterns from P'. 

While studying the equivalence classes of patterns, sometimes it is pos- 
sible to replace the set of forbidden patterns by an equivalent one with 
less patterns in it. Namely we have a partial ordering on the set of all 
permutations (of all possible lengths), namely, t < a if a contains t as a 
consecutive pattern. Given a set P of "forbidden" patterns, to enumerate 
the permutations avoiding all patterns from P, we may assume that P is an 
antichain with respect to this partial ordering. Indeed, ignoring all patterns 
from P that contain a smaller forbidden subpattern does not change the 
set of P-avoiding permutations. Therefore, throughout the paper we shall 
assume that forbidden patterns do indeed form an antichain. 

2.2. Cluster method. The cluster method of Goulden and Jackson [8, 18] is 
a powerful method of enumeration of words and permutations according 
to the number of consecutive occurences of certain patterns. Informally, a 
cluster is a way to link together several patterns from the given set. More 
precisely in the case of permutations ^-clusters relative to the given pattern 
set P are triples (a, ui, . . . , n q , i\, ... , i q ) such that 

- for every k = l,...,q, st(a, 1 , . . . , Oi k+rk -\) - n^, where r k is the length 
of Uk (4 marks the occurrence of the k th pattern tz^); 

- Zjc+i > h (patterns are listed from the left to the right) and Zjt+i <h + r k 
(adjacent patterns are linked); 

- h = 1, and the length of a is equal to l q + r q - 1 (a is completely 
covered by patterns n\,..., n q ). 

Let us denote by n n ^ the number of permutations of length n with exactly 
k occurrences of patterns from P, and by cl„ /(? — the number of (/-clusters 
where the permutation a has length n. We also consider the generating 
functions U{x,t) = T. n ,k n n,k^i k , and C\{x,t) = L M cl M ^fl The following 
enumeration theorem is an immediate consequence of the cluster method. 

Theorem 1 ([8]). We have 

m "fr^ i-x-ekt-D - 

Consequently, the exponential generating function for permutations avoiding pat- 
terns from the set P is 



1 - x - Cl(x, -1)' 

2.3. Chains and series inversion. In this section, we show how to improve 
the cluster method inversion formula for pattern avoidance (2) for a general 
pattern set P. Basically, for patterns without self-overlaps, there is nothing 
to improve: no cancellations happen in the formulas written above. How- 
ever, in general a permutation a can occur in several different ^-clusters for 
different q, which will result in cancellations in Cl(x, -1). We shall explain 
what combinatorial objects correspond to coefficients after these obvious 
cancellations. We still assume that P is an antichain, that is, patterns from P 
are not contained in one another. 



4 



VLADIMIR DOTSENKO AND ANTON KHOROSHKIN 



The main combinatorial concepts we need are (^-chains and their tails. 
They are defined inductively as follows: 

- empty permutation is a 0-chain, it coincides with its tail; 

- the only permutation of one element is a 1-chain, it also coincides 
with its tail; 

- each (/-chain is a permutation a equal to the concatenation a'x, where 
t is the tail of a, and st(a') is a (q - l)-chain; 

- if we denote by %' the tail of a' in the above decomposition, there 
exists a "factorization" t' = afi with st(jSx) e P, and ]St is the only 
occurrence of a pattern from P in t't. 

The way we define the chains here is slightly different from the original 
approach of Anick [1]; the reader familiar with the excellent textbook of 
Ufnarovskii [24] will rather notice similarities with the approach to Anick's 
resolution adopted there. 

Basically, (q + l)-chains are ^-clusters with additional restrictions: only 
neighbours are linked, the first q patterns form an (/-chain, and no proper 
beginning forms an (q + l)-chain (to be precise, one should apply standard- 
ization for the last two properties to make sense). 

Let us give some examples clarifying the notion of a chain. For example, 
if P = {12}, the only n-chain for each n is 12 . . . (n + 1), while if P = {123}, 
we can easily see that 123 is the only 1-chain, and 1234 is the only 2-chain, 
but 12345 is not a 2-chain because it starts from a 2-chain 1234, and is not a 
3-chain because in the only tiling of this permutation by three copies of our 
pattern its first and third occurences overlap: 



Lemma 1. If a is an n-chain, the way to link patterns from P to one another to 
form a is unique. 

Proof. Assume that there are two ways to link n patterns to form a. Obvi- 
ously, for each m, the endpoints of the m th patterns in these two linkages 
should coincide, otherwise we shall find an m-chain whose proper begin- 
ning is an m-chain as well. Once we know that the endpoints of m th patterns 
are the same, the beginnings have to be the same because P is assumed to 
be an antichain (and so patterns from P cannot be contained in one an- 
other). □ 

Let us denote by c n ^ the number of fc-chains in E„; we put 



We also denote by a n the number of permutations in L n avoiding all pat- 
terns from P. Our main result here is that the corresponding exponential 
generation functions are multiplicative inverse to one another. Namely, let 



1 2 3 4 5. 
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Theorem 2. We have 

(3) A(t)C(t) = 1. 

Proof. Let be the set of all permutations y e E„ decomposed as a 
concatenation a A where st(a) is a fc-chain, and A is avoiding all patterns 
from P. We putM = \_\ n ,k^n,k- We shall distinguish different factorisations 
of the same y, so we adopt the notation a | A with a bar between the factors 
for elements of M. 

We define an involution I : M — > Mas follows. We have a = a' z, where z is 
the tail of a, and there are two possibilities: either tA is avoiding all patterns 
from P, or tA = z'nA', where st(7i) e P, and 7i is the leftmost occurrence of a 
pattern from P in tA. In the first case, we put I(a | A) = o' \ zA. In the second 
case, we put I(a | A) = o'z'n | A'. Informally, if it is possible to move the 
tail of a through the bar without creating occurrences of forbidden patterns, 
we do it, and otherwise we extend o to a (k + l)-chain using some elements 
in the beginning of A. It is obvious that I is an involution. Also, it is clear 
that it changes the parity of the parameter k. Thus, for each n > 1, the n th 
coefficient of A(t)C(t), that is 

n v7 n ,k W ' 

is equal to (which implies the formula (3)). Indeed, the number (")c p/ fcfl n _p 
is the number of elements in for which st(a) is a permutation of p 
elements. The above sum of these number with appropriate signs compute 
the difference between the number of such elements where k is even and 
the number of such y's where k is odd. However, our involution establishes 
a bijection between these two subsets, so this difference is equal to 0, as 
required. □ 



3. Examples 

Before moving on to particular results, let us state a general remark. Our 
results suggest that the class of power series that contains all inverses of 
pattern avoidance enumerators is related to some nice combinatorics. Re- 
sults of Elizalde and Noy [7] that we re-prove in Section 3.1 describe some of 
these series as solutions to particular differential equations. Our formulas 
for other cases we considered can be rewritten as more complicated func- 
tional equations. What can be said about other series of that sort? From 
our approach it is always possible to derive recurrence relations, provided 
some additional statistics are taken into account. However, so far we were 
not able to describe a reasonable class of series that cover all of these. For 
example, a wild guess is that all these series satisfy algebraic differential 
equations, that is, if f(x) is such a series, then P(x,f(x),f'(x), . . .,f^ n \x)) - 
for some polynomial P. 

3.1. Applications of the cluster method. 
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3.1.1. Patterns without self-overlaps, unlabelled clusters, and posets. There are 
fairly many situations where clusters are the same as chains. In this section, 
we consider some of these situations, namely the case of an arbitrary pattern 
without self-overlaps, the case of one pattern of length 4, and the pair of 
patterns (132,231). 

Definition 1. A pattern t 6 E m is said to have no self-overlaps if every per- 
mutation of length at most 2m -2 has at most one occurrence of t. (Clearly, 
there always exist permutations of length 2m - 1 with two occurrences of 

T-) 

For example, the pattern 132 is of that form: clearly, we can only link it 
with itself using the last entry. A more general example studied in [7] is 
12 . . . a t {a + 1) e E„, where a + 1 < n, and t is an arbitrary permutation of 
the numbers a + 2, . . . ,n. 

For a pattern t without self-overlaps, there exists a simple way to refor- 
mulate the enumeration problem for clusters in terms of total orderings on 
posets. The first author used this method in [5] in a similar setting, dealing 
with tree monomials in the free shuffle operad. Let form an "unlabelled 
cluster" of the shape that we expect, namely replace temporarily each en- 
try in the expected cluster by the symbol • (a bullet). For example, for the 
pattern 1243 we get 



For such an unlabelled cluster y, let us define a partial ordering on the set 
of bullets of 7 as follows: for each ;', we equip the ; -th bullet pattern with 
a total ordering inherited from the real pattern t. Let us denote by Yl y the 
thus defined poset. 

Example 1. Let us take the bullet pattern above, and replace bullet by letters, 
to make it easier to distinguish between the different bullets: 

a b c d e f g h i j . 

Then the orderings inherited from 1243 are a <b < d < c,d < e < g < f ', and 
g < h < i < j, so we obtain the poset 

/ 
i 

'J 

g 

I 

c e 
\ I 
d 
I 

b 
I 

a 
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(the covering relation of the poset is, as usual, represented by edges; v is 
covered by w if w is the top vertex of the corresponding edge). 

The following proposition is obvious. 

Proposition 1. The set of k-clusters for P = {x}, where x has no self-overlaps is 
in one-to-one correspondence with the set of all total orderings on posets Yl y for 
unlabelled k-clusters y. 

Now we shall see how this approach can be applied in some cases. 

3.1.2. Case of the pattern 12 . . .a x (a + 1). Let a < m, and let 12 . . . a x {a + 1) 
be a permutation in £ m +i which starts from the rise 1,2, ... ,a, followed by 
some permutation x of {a + 2), . . ,,m + 1, followed by the number (a + 1). 
Clearly, this pattern has no self-overlaps, so to enumerate clusters we may 
count total orderings of posets. Note that every fc-cluster for k > 1 is of 
length k(m + 1) - (k - 1) = km + 1. 

Proposition 2. For P = {12 . . . a x (a + 1)}, the number of k-clusters is equal to 



Proof. This proof serves us as a starting example of how to use posets to 
study clusters. The poset which we need to enumerate fc-clusters in this 
case looks like a tree of height m + 1 with the only branch growing on the 
height a + 1, this branch being of length m + 1 and having a smaller branch 
growing at the distance a + 1 from the starting point etc. (An example of such 
a poset for the case of the permutation 1243 with a — 2, m — 3 is given above.) 
To extend such a partial ordering to a total ordering, we should make the 
lowest a + 1 elements for such a tree the smallest elements 1, 2, . . . , a + 1 
of the resulting ordering. Then, there are ( m ~£) ways to choose m - a 
remaining elements forming the stem of our tree, and we are left with the 
same question for a smaller tree, where we may proceed by induction. □ 

Corollary 1 (see [7, 11] for t - 0). For a < m, the multiplicative inverse of the 
generating function Yl counting occurrences of 12... ax (a + 1) e S m+ \ is given 
by the formula 



In particular, all these patterns, for different x, are equivalent to each other. 

Except for the case of the pattern 123 321, this covers all patterns of 
length 3, because 132 312 231 213. We shall deal with the pattern 123 
and, more generally, 12 ... a, in section 3.2. 

3.1.3. Case of one arbitrary pattern without self-overlaps. Generalizing the pre- 
vious result, let us consider an arbitrary pattern x € S m+ \ without self- 
overlaps. Then every fc-cluster for k > 1 is of length km + 1. The following 
result was conjectured in [6], where it was proved in some particular cases. 
Another proof in the general case was, as we were informed by Sergey 



Vl\m-a) 
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Kitaev, obtained by Jeffrey Remmel, the proof being based on methods 
developed in [17]. 

Theorem 3. For a pattern z e S m+ \ without nontrivial self-overlaps, the number 
of permutations of length n with k occurrences of % depends only on m, t(1), and 
z{m + 1). In other words, two non-self-overlapping permutations in S m+ \ are 
equivalent if their first and last entries are the same. 

Proof. This result is also very easy to derive using posets. To make formulas 
compact, let us put a = t(1) - 1 and b - z(m + 1) - 1. The poset whose total 
orderings enumerate ^-clusters is obtained from k totally ordered sets of 
cardinality m + 1 as follows: the element a + 1 of the second set is identified 
with the element b + 1 of the first set, the element a + 1 of the third set is 
identified with the element b + 1 of the second set, etc. Clearly, this poset 
depends only on m, a, and b. 

The actual number of fc-clusters in this case can be computed as follows. 
Let us denote by fk(p) the number of fc-clusters a with a(l) = p + 1. Then it 
is easy to see that the following recurrence relation holds (here we assume, 
without the loss of generality, that a <b): 

Indeed, if we denote q + 1 = a(m + 1), there are (?) ways to choose elements 

less than a (1) in the first pattern in the cluster, ways to choose elements 

greater than o(m + 1) there, (V^Zl) to AH the space between these elements, 
and fk-i(q - b) ways to choose the remaining (k - l)-cluster. □ 

Example 2. Theorem 3 shows that two patterns 23154 and 21534 are equiv- 
alent to each other. Computing the first ten cluster numbers and inverting 
the corresponding series, we get the first ten entries 1, 1, 2, 6, 24, 119, 708, 
4914, 38976, 347776 of the sequence counting permutations that avoid either 
of them. 

3.1 .4. Clusters and chains for patterns of length 4. Let us now consider patterns 
of length 4. The equivalence classes of these are as follows (see [6]): 

I. 1234 = 4321 
II. 2413 3142 

III. 2143 3412 

IV. 1324 = 4231 

V. 1423 3241 =* 4132 =* 2314 

VI. 1342 2431 4213 3124 1432 2341 4123 3214 
VII. 1243 3421 = 4321 = 2134 

The case I will be considered in Section 3.2. In each of the cases VI and 
VII, the pattern has no self-overlaps, so Corollary 1 applies. 

A very special feature of all patterns of length 4 (except for the case I) is 
that they only have self-overlaps of lengths 1 and 2, so patterns in a cluster 
overlap only if they are neighbours. Moreover, in this case chains actually 
coincide with clusters. Let us explain that. In fact, we shall show that even in 
the case of self-overlapping patterns, every labelling of an unlabelled chain 
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that is compatible with ordering of each of the patterns gives a genuine 
chain. Let us show that for the pattern 1324, in other cases the proof is 
similar. Assume that there is a labelling of some unlabelled m-chain c for 
which its proper beginning is a linkage of m patterns as well, and that m is 
the smallest possible integer for which it happens. Then, clearly, the first 
(m - 1) patterns in both c and its beginning are the same, and the linkages 
of the m th pattern with the (m - l) st one differ. However, this would mean 
that the following unlabelled chain has a consistent labelling: 




but for each labelling the orders of fourth and the fifth bullet coming from 
the second and the third bullet pattern contradict one another, which is 
impossible. 

3.1.5. Case of the pattern 1324. 

Theorem 4. The cluster numbers c n j - cl* 3 , 24 for 1324 satisfy the recurrence 



4<2fc+2<« x ' 

with initial conditions = So,*/ c 2,i - 0, c^i - 0. Consequently, the generating 
function for occurrences of 1324 is 



Proof. Counting clusters is reduced to counting total orderings of the corre- 
sponding posets. Let us assume that the first k+1 patterns have two-element 
overlaps, and the following overlap involves just one element. For a cluster 
with a = a\a 2 • • • a 2k+i a 2k+2 a 2k+3 ■ • •/ this means that 

(7) a\ < fl 3 < a 2 < 04/ «3 < a 5 < «4 < «6/ • • • / «2fc-l < a 2k+l < «2Jc < «2ic+2/ 

that {fli, . . . , fl2ic+2l = {1/ • • • / 2A: + 2}, and that st(fl2fc+2«2)c+3 • • •) is an (Z - k)- 
cluster. To prove (6), we notice that the number of permutations a\a 2 . . . a 2 k+2 
of {1, . . . ,2k + 2} for which the conditions (7) are satisfied is given by the 
number of standard Young tableaux of size 2 xfc clearly, a\ = l,a 2 k+ 2 = 2k+2, 
and 



02/«3/ fl 4/ • • -/ fl 2ic+l 



«3 


a 5 


a 7 




a 2k+l 


a 2 


04 


06 




®2k 



gives a bijection with standard Young tableaux. The number of such 
tableaux is equal to the Catalan number ^■( 2 f) (see, for example [22]), 
and the recurrence relation (6) follows. □ 

Example 3. Computing the first ten cluster numbers and inverting the 
corresponding series, we get the first ten entries 1, 1, 2, 6, 23, 110, 632, 4229, 
32337, 278204 of the sequence which is indeed counting permutations that 
avoid 1324 (A113228 in [20]). 



relations 



(6) 





-l 
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3.1.6. Case of the pattern 1423. 

Theorem 5. The cluster numbers c n ^ - cl^ 23 for 1423 satisfy the recurrence 
relations 




4<2fc+2<« 



with initial conditions C\ t i = So,// c 2,i - 0, c 3/ / = 0. Consequently, the generating 
function for occurrences of '1423 is 



Proof. Similarly to the proof of Theorem 4, counting clusters is reduced to 
counting total orderings of the corresponding posets. Let us assume that 
the first k+ 1 patterns have two-element overlaps, and the following overlap 
involves just one element. For a cluster with a = a\a 2 . . . «2Jc+i02/c+2 fl 2ic+3 • • •/ 
this means that 

(9) fli < fl3 < fl4 < a 2 , a 3 < a 5 < a 6 < fl 4/ • • • / fl 2ic-l < a 2k+l < fl 2Jc+2 < fl 2it/ 
SO 

(10) «i < fl 3 < . . . < a 2 k-i < a 2 k+i < a 2 k+2 < «2fc < • • • < «4 < «2/ 

{01,03, . . . , fl 2 fc+i } = {1/ 2, . . . , k+1}, a 2k +2 = k+2, and st(a 2k+2 a 2k+3 . . .) is an (/-fc)- 
cluster. To prove (8), we notice that the number of way to distribute numbers 
between the increasing sequence (10) and the (Z - /c)-chain st(«2fc+2 a 2fc+3 • • •) 
is equal to the number of way to choose the k numbers a^, . . . ,a 2 . The latter 
is clearly the binomial coefficient ("~£~ 2 ), and the recurrence relation (8) 
follows. □ 

Example 4. Computing the first ten cluster numbers and inverting the 
corresponding series, we get the first ten entries 1, 1, 2, 6, 23, 110, 631, 4218, 
32221, 276896 of the sequence counting permutations that avoid 1423. 

3.1.7. Case of the pattern 2143. 

Theorem 6. The cluster numbers c n j = cl 21 ; 43 for 2143 are given by the formula 



with initial conditions Cijip) = 6o,/5i,p, c 2 j(p) - 0, c 3 i(p) - 0. Consequently, the 
generating function for occurrences of '2143 is 




-l 




2<p<n-2 

where the numbers c H/ i(p) satisfy the recurrence relations 



(11) 





-l 
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Proof. Similarly to the proof of Theorem 4, counting clusters is reduced 
to counting total orderings of the corresponding posets. Let c K/ ;(p) be the 
number of Z-clusters for which a is a permutation of length n with a(l) = p. 
Let us assume that the first k + 1 patterns in our cluster have two-element 
overlaps, and the following overlap involves just one element. For a cluster 
with a - a\ai . . . fl2ic+i fl 2it+202it+3 ■ ■ ■, this means that 

(12) fl2 < a l < fl 4 < fl 3/ fl 4 < fl 3 < fl 6 < fl 5/ • • • / fl 2ic < fl 2lc-l < fl 2lc+2 < a 2k+lr 
SO 

(13) fl2 < «1 < #4 < «3 • • • < «2ic < «2fc-l < #2ic+2 < «2Jt+l/ 

and st(fl2ic+2 fl 2/t+3 • • •) is an (/ - ^-cluster. Assume that a.\ - p. To prove (11), 
we notice that if fl2ic+2 = <?/ then there are ^^XZn) ways to pick the numbers 
A3, . . .,fl2ic/ p - 1 ways to pick ai, (n - q) ways to pick «2fc+i/ an d c n -2k-i,i-k 
ways to pick the remaining (Z - Zc)-cluster (where the entry q is (q - 2k) th 
biggest). This completes the proof. □ 

Example 5. Computing the first ten cluster numbers and inverting the 
corresponding series, we get the first ten entries 1, 1, 2, 6, 23, 110, 631, 4223, 
32301, 277962 of the sequence counting permutations that avoid 2143. 

In the last remaining case (II in the list above), we have no trick like 
above that would simplify the computations, so we shall use the most 
general strategy for chain enumeration, which allows to compute the chain 
numbers rather fast (polynomially in n) for all sets of forbidden patterns. 
There is an obvious similarity with the approach of Kitaev and Mansour 
in [13]. 

3.1.8. Case of the pattern 2413. 

Theorem 7. The cluster numbers c n j = cl^ 4 ; 13 for 2413 are given by the formula 

c n ,l= c n,&P>tf)> 

\<p<q-\<n 

where the numbers c n ^{p, q) satisfy the recurrence relations 

(14) c ni i(p,q)= Yj c n _ 2 ,/-i(r,s-l)+ 

r<p<s<q 

+ Yj (p-l)c„- 3/ /-i(r-l,s-l)+ Yj (p-l)c„- 3 ,/-i(r-l,s-2). 

p<r<s<q p<r<q<s 

with initial conditions C2 y i(p,q) = 0, c 3/ i(p,q) = 0, c^i(p,q) = 6;,i 6^,26^,4- Conse- 
quently, the generating function for occurrences of 2143 is 

1-X- > ; 

\ n>2,l>l ) 

Proof. This statement is straightforward. Indeed, let us consider an n- 
cluster with a = a^ao, .... The first pattern in that cluster intersects with 
its neighbour by either two or one elements. In the first case, we have 
03 < «2 < fl 4 < a \, so if we fix a\ and a^, and forget about them, we are left 
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with an (n - l)-cluster, and we should sum over all choices of a$ and a$ for 
its first entries. If, on the contrary the first overlap uses just one element, 
then there are (a\ - 1) choices for as, and we should distinguish between the 
cases a$ > «2 and a$ < a 2 : in the first case a% is the (05 - l) st biggest in the 
remaining cluster, while in the second case it is the (a% - 2) nd biggest. □ 

Example 6. Computing the first ten cluster numbers and inverting the 
corresponding series, we get the first ten entries 1, 1, 2, 6, 23, 110, 632, 4237, 
32465, 279828 of the sequence counting permutations that avoid 2413. 



3.1.9. Case of two patterns {132,231}. The following theorem is mentioned 
in [6]. 



Proof. This pair of patterns has no self-overlaps at all (both for a pattern 
with itself, and two patterns with each other); clearly, clusters are nothing 
but "up-down" permutations, that is permutations a\a 2 . . . a 2 ia 2 i+i for which 
a i < a 2 > a 3 < fl 4 > • • • < a n > a 2l+i ■ h is well known that the number of 
such permutations is equal to the tangent number. The square roots in (15) 
account for the fact that every up-down permutation of length 21 + 1 is an 
/-cluster. □ 

3.2. Applications of the chain method. The cases we consider in this sec- 
tion are some of those where the number of fc-chains is substantially smaller 
than the number of fc-clusters. For example, this happens if the set of for- 
bidden patterns contains the pattern 12... a, which marks rises of length a 
in permutations. 

3.2.1. Case of the pattern 12 ... a. The following result is well known. 

Theorem 9 ([7, 9, 11]). The multiplicative inverse of the exponential generating 
function for patterns avoiding 12 ... a ("permutations without a consecutive rises") 
is given by the formula 



Proof. Indeed, fc-chains for k > 2 are as follows: 

- the only 2-chain is [12 ...a]; 

- the only 3-chain is [12... (a + 1)]; 

- the only 4-chain is [12 . . . (2a)]; 

- the only 5-chain is [12 . . . (2a + 1)]; 



Theorem 8. The cluster number c H/ i = cl , ' is not equal to zero only for 
n = 21 + 1, and in this case is equal to E 2 i + \, the tangent number [22], so the 
generating function for occurrences of {132, 231} is 




(16) 




- the only (2fc)-chain is [12 . . . (ka)\, 

- the only (2k + l)-chain is [12 . . . (ka + 1)]; 
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□ 



Our next result classifies chains for {123, 132}. It is interesting to compare 
it with the result of Claesson [2] stating that the number of permutations 
of length n avoiding {123, 132} is equal to the number of involutions of 
length n. 

3.2.2. Case of two patterns {123, 132}. 

Theorem 10. The numbers c n j ofl-chains of length n for {123,132} satisfy the 
recurrence relations 



with initial conditions c 1} = 6^/, c 2 ,i - 0. Consequently, the exponential generat- 
ing function for permutations avoiding {123,132} is 



Proof. The permutation 132 has no self-overlaps with itself, and have one 
nontrivial overlap with 123. It is easy to see that every /-chain for this 
pair of permutations is either an Z-chain for 123 or, for some k < I it starts 
from a (k - l)-chain for 123 which overlaps with 132 (by one or two entries, 
depending on the parity of k, as for the single pattern 12 ... a in Theorem 9), 
which overlaps by its last entry with an (Z - fc)-chain. The only parameter 
here that varies is the entry in the place of 3 for 132. Taking that into account, 
we obtain the above recurrence relation. In it, the first summand counts 
chains for 123, while the two sums take care of the segment to the first 
occurrence of 132 for both possible parities. □ 

Example 7. Computing the first ten chain numbers and inverting the cor- 
responding series, we recover the first ten entries 1, 1, 2, 4, 10, 26, 76, 232, 
764, 2620 of the sequence counting involutions (A000085 in [20]). 

4. Appendix: a homological interpretation of the main result 

In this section, we discuss another interpretation of the formula (3), 
putting our bijection I in the context of homological algebra for shuffle 
algebras [19] and of modules over the associative operad [14, 15]. 

4.1. Shuffle algebras. A shuffle algebra, as defined in the paper of Maria 
Ronco [19], is a graded vector space A = (£) k>0 A k together with maps 



subject to certain associativity condition. Here Sh(n, m) denotes the set of 
all (n, m)-shuffles, that is permutations a e S n+m for which a(l) < . . . < a(n) 
and o(n + 1) < . . . < o(n + m). 

One can slightly re-phrase this definitition. Define a new monoidal struc- 
ture on graded vector spaces as follows: if A - (£) k>0 A k and B - (J) k>Q B k 




3<3k<n 



1<3k+l<n 




-1 



y : A n <g> A m <g> kSh(n, m) -> A, 
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are two graded vector spaces, we put 

n 

(18) (A bB)„ = Ai <8> B n -i ® kSh(i, n - i). 

This defines a monoidal structure on graded vector spaces, and a shuffle 
algebra is a monoid in this category. The advantage of this approach is that 
arbitrary constructions and definitions available for monoidal categories 
(modules, homology, resolutions) enter our story for free. 

Within the framework of shuffle algebras, our results of Section 2.3 should 
be stated as follows. It is easy to see that the graded vector space ©„ >0 0<S„ 
is a shuffle algebra; moreover, this shuffle algebra is a free shuffle algebra 
with one generator of degree 1 [19]. 

Let A be the graded vector space whose n th component A n is spanned 
by all permutations avoiding permutations from the given set P. In other 
words, A is the quotient of the free shuffle algebra with one generator by the 
two-sided shuffle ideal generated by all permutations from P. We shall now 
exhibit a free right module resolution of it trivial right module. Let C be the 
graded vector space whose n th component C„ is spanned by n-chains. Using 
the involution I, one can endow the graded space C H A with a structure of 
a chain complex. The differential d of this complex maps a basis element 
a | A (in the notation of Section 2.3) to 0, if I moves the bar in this element 
to the right, and to I(a \ A) if I moves the bar to the left. Clearly, d 2 - 0, and 
this complex is acyclic in positive degrees, so it is a resolution of the trivial 
module. Computing Euler characteristics of its graded components results 
in the equation (3). 

4.2. Modules over the associative operad. This last section is intended for 
those readers whose intuition, as it is for us, comes from the operad theory. 
Essentially, it re-tells the shuffle algebra approach in a slightly different way, 
explaining also the place for classical pattern avoidance in the story. 

Studying varieties of algebras, that is, algebras satisfying certain identi- 
ties, goes back to works of Specht [21]. The notions of T-ideals and T-spaces 
formalize the ways to derive identities from one another. One natural way 
to study identities is to define an analogue of a Grobner basis for an ideal 
of identities. This approach is taken in works of Latyshev [14, 15] who 
suggested a combinatorial approach to study associative algebras with ad- 
ditional identities via standard bases of the corresponding T-spaces. His 
approach can be described as follows. For each "T-space" (in other words, 
right ideal in the associative operad), he defines a version of a Grobner 
basis; such a basis would allow to study arbitrary relations via monomials 
avoiding certain patterns. Here, for once, by a pattern we mean a classi- 
cal pattern (its occurrence does not have to be as a consecutive sub word). 
This approach has a slight disadvantage. Namely, even though the actual 
Grobner bases of relations are expected to be finite (at least, the famous 
result of Kemer [10] states that in principle there exists a finite set of gen- 
erating identities), they are difficult to compute, as there is no algorithm 
comparable to the one due to Buchberger in the associative algebra case [24]. 
Somehow, this trouble disappears if we study left ideals in the associative 
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operad. In terms of divisibility of monomials, left and right ideals lead 
to two very different combinatorial problems: right divisibility translates 
into occurrences of classical patterns, whereas left divisibility yields occur- 
rences of consecutive ones! For consecutive patterns, the intuition of [4, 3] 
for Grobner bases and resolutions applies directly, and our results can be 
interpreted in terms of appropriate free left module resolutions. Let us re- 
mark that the difference between chains and clusters is exactly the same as 
the difference between the Anick's resolution for associative algebras and 
the not-quite-minimal resolution constructed in [3]; in this framework we 
managed to obtain an explicit canonical minimal resolution, unlike what 
happens in the general operadic case. 
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